OLIGONUCLEOTIDE IDENTIFIERS 

Related Applications 

This application is a continuation of U.S. Non-Provisional Patent Application 
5 Serial Number 10/004,275, filed November 15, 2001, which claims priority to U.S. 
Provisional Patent Application Serial Numbers 60/248,863, filed November 15, 2000, 
60/252,650, filed November 22, 2000, Great Britain Patent Application Serial 
Number 0101054.5, filed January 15, 2001, U.S. Provisional Patent Application Serial 
Numbers 60/276,995, filed March 19, 2001, 60/302,231, filed June 29, 2001, 
10 60/326,937, filed October 3, 2001, and 60/327,089, filed October 3, 2001 . 

Field of the Invention 

This invention relates generally to methods, assays, and components for the 
rapid, high-throughput, specific and sensitive detection and analysis of biomolecular 
15 and chemical interactions, and more particularly to identifiers for identification of 
participants in these and other assays. 

Background of the Invention 

International Patent Application Serial No. PCT/US00/01504, published July 
27, 2000 as WO 00/43783 describes a variety of assays involving colloids. * 

20 International Patent Publication No. PCT/US00/01997, filed January 25, 2000 

and U.S. Patent Application Serial No. 09/631,818, filed August 3, 2000 describe 
methods, assays, and components for analyzing species associated with disease and 
for screening of candidate drugs for treatment of disease. Assays involving 
colloid/colloid interaction are described in detail. 

25 In vitro techniques that currently exist for studying protein-protein interactions 

include co-immunoprecipitation, co-fractionation by chromatography, cross-linking, 
sandwich assays and surface plasmon resonance. A disadvantage of these techniques 
is that putative binding partners must be sequentially tested which greatly limits the 
number of potential interacting proteins that can be tested. There are two reasons why 

30 experiments have to be performed sequentially. The first is a signaling problem. In a 
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typical binding assay, a single type of signal is produced when binding occurs. 
Putative binding partners must therefore be kept isolated, then tested sequentially for 
pair-wise interactions. The second reason for sequential experiments is a 
bookkeeping problem. Since it is very difficult to identify proteins, especially when 
5 at low concentrations, it is necessary to keep track of isolated and purified species, 
then test for binding in pair-wise fashion. 

In v/vo, cell-based binding assays such as the yeast two hybrid system and 
yeast mating system provide a major advantage over existing in vitro methods in that 
once a positive protein-protein interaction has been detected, the host cell, which 
10 provides the signal, contains ample DNA that codes for the protein(s) under study. As 
those skilled in the art appreciate, it is far easier to sequence DNA that proteins or 
peptides. Additionally, DNA at low concentration can be enzymatically amplified 
prior to sequencing whereas proteins cannot. This eliminates the need for tracking 
individual aliquots of purified proteins and facilitates high throughput screening to 
15 detect protein-protein interactions. 

There are, however disadvantages of in vivo protein detection systems. In vivo 
assays suffer from false positives and negatives because of the inherent redundancies 
of the biological processes upon which the assays are based. For example, the yeast 
two-hybrid system is based on a mechanism of transcriptional activation. Another 
20 disadvantage of the system is that it can only detect interactions between cell-derived 
species. Therefore, interactions between proteins and chemical species, such as drug 
candidates or chemical recognition elements cannot be detected using this method. 

While a wide variety of biological and chemical assay techniques are known, 
assays with enhanced multiplexing capability that do not sacrifice accuracy in 
25 detection would be advantageous. Therefore, an in vitro binding assay in which 

genetic material that codes for expressed proteins or chemical species is available, and 
can be correlated to a specific species after the binding assay, would provide for high 
throughput and a major advantage over existing systems. 



Summary of the Invention 

The present invention provides a series of methods, components, kits, etc. for 
use in chemical and biological analysis. Specifically, the invention provides 
5 techniques for studying binding interactions between chemical or biological species, 
such as binding interactions between proteins. The invention allows for high- 
throughput, multiplexed screening of interactions between species. That is, large 
numbers of interactions can be screened simultaneously, as opposed to those prior 
techniques in which binding partner candidates were screened sequentially. It is a 
10 significant advantage of the present invention that high-throughput, multiplexed 
screening can be conducted in vitro. 

The invention provides techniques for determination of where binding 
interactions have occurred among many possibilities of binding interactions, and rapid 
selection and identification of species that have participated in binding interactions. 
15 Another method involves allowing a chemical or biological species, 

immobilized relative to a surface, to participate in a chemical or biological interaction. 
The identification of the chemical or biological species that participated in the 
interaction is then determined by identifying an oligonucleotide identifier associated 
with the surface, optionally by identifying a unique combination of two 
20 oligonucleotide identifiers associated with each of the interacting partners. 

Identifying the combined identifiers uniquely identifies the interacting pair. - In one 
embodiment, an identifier can be an oligonucleotide that codes for a protein that it 
identifies in the assay. 

Another method involves expressing a protein with an oligonucleotide, and 
25 immobilizing the protein and the oligonucleotide relative to each other. 

Another method of the invention includes expressing a protein from a nucleic 
acid and immobilizing the protein and the oligonucleotide relative to each other. 

Another aspect of the invention involves articles. One article of the invention 
has a surface with a chemical or biological species able to participate in a chemical or 
30 biological interaction, fastened to or adapted to be fastened to the surface. An 

oligonucleotide identifier is also fastened to or adapted to be fastened to the surface. 
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As in the above method, in this aspect, according to one embodiment, the identifier 
can be an oligonucleotide that codes for a protein that it identifies in the assay. 

The invention also provides kits for biological or chemical analysis. One kit is 
defined as the article described in the paragraph above. Another kit includes at least 
5 one additional article having a surface, a second chemical or biological species 
fastened to or adapted to be fastened to the surface, and a second oligonucleotide 
identifier fastened to or adapted to be fastened to the surface. The kit can contain a 
set of oligonucleotides, or derivatives of oligonucleotides, which completely represent 
all possible oligonucleotide identifier sequence combinations that uniquely identify 

10 any two interacting partners. The kit can contain a set of nucleic acid identifiers to 
interactions that involve more than two interacting species. In another embodiment a 
kit of the invention includes a surface, a protein immobilized relative to the surface or 
adapted to be immobilized relative to the surface, and an oligonucleotide identifier 
that codes for the protein, immobilized relative to the surface or adapted to be 

15 immobilized relative to the surface. 

In another embodiment a kit of the invention includes a polymer or dendrimer 
rather than a surface. 

In another embodiment a kit of the invention includes a protein and an 
oligonucleotide identifier that codes for the protein, immobilized relative to each other 

20 or adapted to be immobilized relative to each other, and an entity carrying 
immobilized thereto a binding partner of the protein. 

Another aspect of the invention involves compositions. One composition 
comprises a protein and an oligonucleotide identifier that codes for the protein, 
immobilized relative to each other or adapted to be immobilized relative to each other. 

25 Another composition of the invention comprises a chemical or biological 

species, able to participate in a chemical or biological interaction, a linker species that 
is not a ribosome, and an oligonucleotide identifier, wherein each of the chemical or 
biological species and the oligonucleotide identifier is fastened to or adapted to be 
fastened to the linker species. The linker species can be a surface of nanoparticle, 

30 chip, polymer, dendrimer, RNA binding protein, DNA binding protein, etc. 
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It is not intended that the present invention be limited by the nature of the 
solid support. In one embodiment, the solid support is a colloid (e.g. gold colloid). It 
is also not intended that the present invention be limited by the nature of attachment 
of the ligand to the solid support. In one embodiment, said ligand is covalently 
5 attached (directly or through another ligand or binding moiety) to the solid support. In 
another embodiment, the ligand is attached non-covalently or by electrostatic or ionic 
interaction. 

In some embodiments signaling entities are useful. In such embodiments, the 
invention contemplates a variety of signaling entities described below, including but 

10 not limited to optically active entities such as fluorescent molecules and enzymes 

capable of acting on color-producing substrates. Preferred signaling entities include 
electroactive molecules, that is, molecules having an oxidation/reduction potential 
that can be determined electronically or electrochemically proximate a working 
electrode of an appropriate, conventional electrical arrangement, as signaling 

15 elements. 

It is not intended that the present invention be limited by the nature of the 
chemical or biochemical agent. A wide variety of agents and binding partners of 
those agents such as protein/protein, protein/peptide, antibody/antigen, 
antibody/hapten, enzyme/substrate, enzyme/inhibitor, enzyme/cofactor, binding 

20 protein/substrate, carrier protein/substrate, lectin/carbohydrate, receptor/hormone, 
receptor/effector, complementary strands of nucleic acid, protein/nucleic acid 
repressor/inducer, ligand/cell surface receptor, virus/ligand, etc., can be used for 
binding interactions of the inventions. In one embodiment, the agent is a ligand, 
specifically a peptide. In a preferred embodiment, the peptide is derivatized with a 

25 moiety (such as a histidine tag) that can bind to a metal chelate. In this embodiment, 
it is convenient that the solid support comprise a metal chelate and said peptide is 
attached to said solid support via binding of said moiety to said metal chelate. 

In some embodiments, cell-derived molecules, including both cell-surface 
receptors and intracellular signaling proteins, exist on or are attached to solid supports 

30 that can either be surfaces or particle-like in nature. Binding partners of these cell- 
derived proteins, which can include both known and unknown ligands as well as 



7 



putative drug candidates, are attached to surfaces and/or particle-like structures, and 
are allowed to interact with the cell-derived proteins in a manner such that binding 
between the two binding partners occurs. One of the binding partners or its attached 
support can additionally be derivatized with a detectable substance. Interacting 
5 complexes are identified using characteristics of the associated complex that 

differentiate it from the unassociated binding partners. The presence of, or a change 
in, a detectable moiety, that is either co-immobilized with one of the binding partners 
on a common solid support or directly attached to one of the binding partners, is 
detected. Molecules that disrupt a relevant interaction can be identified by detecting a 

10 loss of this signal. Interacting partners are brought to a sensing apparatus by 

confining one of the binding partners to the sensing area and allowing it to recruit the 
other binding partner, or by manipulating characteristics of the associated complex 
that differentiate it from the unassociated binding partners, or by attaching a 
recruitable element to one of the binding partners or its associated solid support. 

15 One embodiment of the invention involves recruiting an electronic signaling 

entity to an electrode using a magnetic material. This embodiment can find use in 
many assays and other techniques of the invention. In the method, typically, a 
signaling entity is provided with the ability to become immobilized relative to the 
magnetic material (which can be a magnetic bead). The magnetic material and 

20 signaling entity can become immobilized relative to each other via a variety of 

chemical and/or physical linkages described herein. For example a first species may 
be immobilized relative to or fastened to a magnetic material and a second species 
may be immobilized relative to or fastened to the signaling entity or the first and 
second species can bind to each other. The first and second species can be essentially 

25 any species described herein or known in the art for binding, and in one embodiment 
are proteins. In a preferred embodiment, the proteins are not antibodies but are, for 
example, a ligand and a cognate receptor, etc. The signaling entity can be fastened to 
an intermediate entity, such as a colloid particle, to which one of the proteins that acts 
as a binding partner also is fastened. A signaling entity also can be recruited to an 

30 electrode without use of a magnetic material. In this arrangement the signaling entity 
can be immobilized with respect to a binding partner of a species with respect to the 
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electrode, and the binding partners can be allowed to bind to each other. Essentially 
any binding partner interaction as described herein or known in the art can facilitate 
this technique. 

It is to be understood that aspects of the invention involving an 
5 oligonucleotide identifier can be used in connection with any aspects described 
herein, and that the oligonucleotide identifier has application in essentially any 
chemical or biological binding study. 

Other advantages, novel features, and objects of the invention will become 
apparent from the following detailed description of the invention when considered in 
10 conjunction with the accompanying drawings, which are schematic and which are not 
intended to be drawn to scale. In the figures, each identical or nearly identical 
component that is illustrated in various figures is represented by a single numeral. 
For purposes of clarity, not every component is labeled in every figure, nor is every 
component of each embodiment of the invention shown where illustration is not 
15 necessary to allow those of ordinary skill in the art to understand the invention. 

Brief Description of the Drawings 

Fig. 1 illustrates schematically an embodiment of a colloid particle 140 
adapted to bind essentially any chemical or biological species and also to bind an 
20 oligonucleotide identifier. 

Fig. 2 illustrates schematically a chip including a plurality of spatially- 
addressable regions, each region having a chemical or biological species (putative 
binding species) and an oligonucleotide identifier. 

Fig. 3 illustrates, schematically, another embodiment showing a chip to which 
25 one or more chemical or biological species are fastened. 

Fig. 4 illustrates an oligonucleotide identifier of the invention adapted to be 
fastened to a surface, specifically via a self-assembled monolayer- forming species. 

Fig. 5 illustrates identification of the polyamino acid tag of Figures 4-8, 
following separation from the surface of the colloid particle to which it had been 
30 fastened. 
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Fig. 6 illustrates a surface of a colloid particle to which is fastened an 
oligonucleotide identifier (Fig. 4) and a biological binding partner. 

Fig. 7 illustrates biological binding between first and second biological 
binding partners attached to first and second colloid particles, respectively. 
5 Fig. 8 illustrates separation of the oligonucleotide identifier of Fig. 6 from the 

surface of the colloid particle to which it had been fastened. 

Fig. 9 illustrates an oligonucleotide identifier and a biological binding partner, 
each fastened to a surface of a colloid particle. 

Fig. 10 illustrates two colloid particles, each carrying a biological species that 
10 biologically binds to the species of the other colloid particle, and each carrying a 
oligonucleotide identifier. 

Fig. 1 1 illustrates binding of an interaction hybridization identifier to the 
combination of the oligonucleotide identifiers bound, respectively, to the colloid 
particles of Fig. 10. 

15 Fig. 12 illustrates de-activating any non-hybridized oligonucleotide. 

Fig. 13 illustrates the result of the step of Fig. 12. 

Fig. 14 illustrates denaturization of the interaction hybridization identifier of 
Figures 1 1-13; 

Fig. 15 illustrates identification of chimeric oligo solution and thereby 
20 identification of the oligonucleotide identifiers of Figures 10-13. 

Fig. 16 shows ACV demonstration of enhanced electronic communication 
across a self-assembled monolayer, and redox signaling of protein immobilization to a 
cell surface, against a control. 

Fig. 17 shows ACV analysis of protein/protein interaction as measured by 
25 binding of a colloid to a magnetic bead. 

Fig. 18 illustrates how two binding partners can be detected through magnetic 
recruitment. 

Fig. 19 illustrates a multiplexing apparatus for applying and releasing a 
magnetic force at multiple locations on a continuous surface. 
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Detailed Description of the Invention 

International patent application serial number PCT/US00/01997, filed 
01/25/00 by Bamdad et ah, entitled "Rapid and Sensitive Detection of Aberrant 
5 Protein Aggregation in Neurodegenerative Diseases" (published as WO 00/43791 on 
07/27/00), International patent application serial number PCT/US00/01504, filed 
01/21/00 by Bamdad, et al, entitled "Interaction of Colloid-Immobilized Species with 
Species on Non-Colloidal Structures" (published as WO 00/43783 on 07/27/00), 
commonly-owned, copending U.S. patent application serial no. 09/602,778, filed 

10 06/23/00 by Bamdad et ah, entitled "Interaction of Colloid-Immobilized Species with 
Species on Non-Colloidal Structures"; and commonly-owned, copending U.S. patent 
application serial no. 09/631,818, filed 08/03/00 by Bamdad et al., entitled "Rapid and 
Sensitive Detection of Protein Aggregation" all are incorporated herein by reference. 
"Small molecule", as used herein, means a molecule less than 5 kiloDalton, 

15 more typically less than 1 kiloDalton. As used herein, "small molecule" excludes 
proteins. 

The term "candidate drug" as used herein, refers to any medicinal substance 
used in humans, animals, or plants. Encompassed within this definition are 
compound analogs, naturally occurring, synthetic and recombinant pharmaceuticals, 

20 hormones, antimicrobials, neurotransmitters, etc. This includes any substance or 
precursor (whether naturally occurring, synthetic or recombinant) which is to be 
evaluated for use as a drug for treatment of neurodegenerative disease, or other 
disease characterized by aberrant aggregation, or prevention thereof. Evaluation 
typically takes place through activity in an assay, such as the screening assays of the 

25 present invention. 

A variety of types of particles can be used in the invention. For example, 
"fluid suspendable particle" means a particle that can be made to stay in suspension in 
a fluid in which it is used for purposes of the invention (typically an aqueous solution) 
by itself, or can be maintained in solution by application of a magnetic field, an 

30 electromagnetic field, agitation such as stirring, shaking, vibrating, sonicating, 

centrifuging, vortexing, or the like. Examples include colloid particles, nanocrystals, 
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and the like. A "nanoparticle" is a particle that can be fluid-suspendable, having a 
maximum cross-sectional dimension of no more than 500 nanometers, preferably no 
more than 250 nanometers. A "magnetically suspendable" particle is one that can be 
maintained in suspension in a fluid via application of a magnetic field. An 
5 electromagnetically-suspendable particle is one that can be maintained in suspension 
in a fluid by application of an electromagnetic field (e.g., a particle carrying a charge, 
or a particle modified to carry a charge). A "self-suspendable particle" is a particle 
that is of low enough size and/or mass that it will remain in suspension in a fluid in 
which it is used (typically an aqueous solution), without assistance of for example a 

10 magnetic field, for at least 1 hour. Other self-suspendable particles will remain in 
suspension, without assistance, for 5 hours, 1 day, 1 week, or even 1 month, in 
accordance with the invention. 

"Proteins" and "peptides" are well-known terms in the art, and are not 
precisely defined in the art in terms of the number of amino acids that each includes. 

15 As used herein, these terms are given their ordinary meaning in the art. Generally, 
peptides are amino acid sequences of less than about 100 amino acids in length, but 
can include sequences of up to 300 amino acids. Proteins generally are considered to 
be molecules of at least 100 amino acids. 

As used herein, a "metal binding tag" refers to a group of molecules that can 

20 become fastened to a metal that is coordinated by a chelate. Suitable groups of such 
molecules include amino acid sequences, typically from about 2 to about 10 amino 
acid residues. These include, but are not limited to, histidines and cysteines 
("polyamino acid tags"). Such binding tags, when they include histidine, can be 
referred to as a "poly-histidine tract" or "histidine tag" or "HIS-tag", and can be 

25 present at either the amino- or carboxy-terminus, or at any exposed region, of a 
peptide or protein or nucleic acid. A poly-histidine tract of six to ten residues is 
preferred for use in the invention. The poly-histidine tract is also defined functionally 
as being a number of consecutive histidine residues added to a protein of interest 
which allows the affinity purification of the resulting protein on a metal chelate 

30 column, or the identification of a protein terminus through the interaction with 
another molecule (e.g. an antibody reactive with the HIS-tag). 
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"Affinity tag 11 is given its ordinary meaning in the art. Affinity tags include, 
for example, metal binding tags, GST (in GST/glutathione binding clip), and 
streptavidin (in biotin/streptavidin binding). At various locations herein specific 
affinity tags are described in connection with binding interactions. It is to be 
5 understood that the invention involves, in any embodiment employing an affinity tag, 
a series of individual embodiments each involving selection of any of the affinity tags 
described herein. 

As used herein, "chelate coordinating a metal" or metal coordinated by a 
chelate, refers to a metal coordinated by a chelating agent that does not fill all 

10 available coordination sites on the metal, leaving some coordination sites available for 
binding via a metal binding tag. U.S. Patent No. 5,620,850 of Bamdad, et al., 
incorporated herein by reference, describes exemplary chelates. Examples include 
nitrilotriacetic acid, 2,2'-bis(salicylideneamino)-6,6'-demethyldiphenyl, and l,8-bis(a- 
pyridyl)-3,6-dithiaoctane, or the like. 

15 "Signaling entity" means an entity that is capable of indicating its existence in 

a particular sample or at a particular location. Signaling entities of the invention can 
be those that are identifiable by the unaided human eye, those that may be invisible in 
isolation but may be detectable by the unaided human eye if in sufficient quantity 
(e.g., colloid particles), entities that absorb or emit electromagnetic radiation at a level 

20 or within a wavelength range such that they can be readily detected visibly (unaided 
or with a microscope including an electron microscope or the like), or 
spectroscopically, electroactive entities that can be detected electronically or 
electrochemically, such as redox-active molecules exhibiting a characteristic 
oxidation/reduction pattern upon exposure to appropriate activation energy 

25 ("electronic signaling entities"), or the like. Examples include optically active entities 
such as dyes, pigments, transition metal complexes, redox-active metal complexes, 
fluorescent or phosphorescent moieties (including, by definition, fluorescent or 
phosphorescent proteins such as green fluorescent protein (GFP), phosphorescent 
moieties), up-regulating phosphors, chemiluminescent entities, 

30 electrochemiluminescent entities, or enzyme-linked signaling moieties including 

horse radish peroxidase and alkaline phosphatase. "Precursors of signaling entities" 
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are entities that by themselves may not have signaling capability but, upon chemical, 
electrochemical, electrical, magnetic, or physical interaction with another species, 
become signaling entities. An example includes a chromophore having the ability to 
emit radiation within a particular, detectable wavelength only upon chemical 
5 interaction with another molecule. Precursors of signaling entities are distinguishable 
from, but are included within the definition of, "signaling entities" as used herein. 
Another example of a signaling entity is a particle that is made up of material that 
possesses an inherent signaling capability, including those materials whose signaling 
capabilities requires excitation with external energy sources. A preferred 

10 electroactive molecule as a signaling entity of the invention is a metallocene. 

Metal locenes that can operate as electroactive signaling elements in accordance with 
the invention are known. One example of a particularly preferred electroactive 
molecule is one containing a ferrocene or a ferrocene derivative group or derivative, 
such as ferrocenyl thiol (C3sH 2 4FeS); however, other organic complexes of transitions 

15 metals are also contemplated as signaling elements. 

As used herein, "fastened to or adapted to be fastened", in the context of a 
species relative to another species or to a surface of an article, means that the species 
is chemically or biochemically linked via covalent attachment, attachment via specific 
biological binding (e.g., biotin/streptavidin), coordinative bonding such as 

20 chelate/metal binding, or the like. For example, "fastened" in this context includes 

multiple chemical linkages, multiple chemical/biological linkages, etc., including, but 
not limited to, a binding species such as a peptide synthesized on a polystyrene bead, 
a binding species specifically biologically coupled to an antibody which is bound to a 
protein such as protein A, which is covalently attached to a bead, a binding species 

25 that forms a part (via genetic engineering) of a molecule such as GST or Phage, which 
in turn is specifically biologically bound to a binding partner covalently fastened to a 
surface (e.g., glutathione in the case of GST), etc. As another example, a moiety 
covalently linked to a thiol is adapted to be fastened to a gold surface since thiols bind 
gold covalently. Similarly, a species carrying a metal binding tag is adapted to be 

30 fastened to a surface that carries a molecule covalently attached to the surface (such as 
thiol/gold binding) which molecule also presents a chelate coordinating a metal. A 
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species also is adapted to be fastened to a surface if a surface carries a particular 
nucleotide sequence, and the species includes a complementary nucleotide sequence. 

"Covalently fastened" means fastened via nothing other than one or more 
covalent bonds. E.g. a species that is covalently coupled, via EDC/NHS chemistry, to 
5 a carboxylate-presenting alkyl thiol which is in turn fastened to a gold surface, is 
covalently fastened to that surface. 

"Specifically fastened" or "adapted to be specifically fastened" means a 
species is chemically or biochemically linked to another specimen or to a surface as 
described above with respect to the definition of "fastened to or adapted to be 
10 fastened", but excluding all non-specific binding. 

"Non-specific binding", as used herein, is given its ordinary meaning in the 
field of biochemistry. 

"Colloids", as used herein, means nanoparticles, i.e. very small, self- 
suspendable or fluid-suspendable particles including those made of material that is, 
15 e.g., inorganic or organic, polymeric, ceramic, semiconductor, metallic (e.g. gold), 
non-metallic, crystalline, amorphous, or a combination. Typically, colloid particles 
used in accordance with the invention are of less than 250 nm cross section in any 
dimension, more typically less than 100 nm cross section in any dimension, and in 
most cases are of about 2-30 nm cross section. One class of colloids suitable for use 
20 in the invention is 10-30 nm in cross section, and another about 2-10 nm in cross 

section. As used herein this term includes the definition commonly used in the field 
of biochemistry. 

A "moiety that can coordinate a metal", a used herein, means any molecule 
that can occupy at least two coordination sites on a metal atom, such as a metal 
25 binding tag or a chelate. 

As used herein, a component that is "immobilized relative to" another 
component either is fastened to the other component or is indirectly fastened to the 
other component, e.g., by being fastened to a third component to which the other 
component also is fastened, or otherwise is translationally associated with the other 
30 component. For example, a signaling entity is immobilized relative to a binding 

species if the signaling entity is fastened to the binding species, is fastened to a colloid 
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particle to which the binding species is fastened, is fastened to a dendrimer or 
polymer to which the binding species is fastened, etc. A colloid particle is 
immobilized relative to another colloid particle if a species fastened to the surface of 
the first colloid particle attaches to an entity, and a species on the surface of the 
5 second colloid particle attaches to the same entity, where the entity can be a single 
entity, a complex entity of multiple species, a cell, another particle, etc. In all 
embodiments of the invention, where a species is described as immobilized relative to 
another entity (another species, a surface, etc.), it to be understood that the species can 
be fastened to the entity in some embodiments, where those of ordinary skill in the art 

10 would understand that it is possible for the species to be fastened to the entity. 

"Diverse biological species" means different animals, such as mouse and 
hamster, mouse and goat, etc. 

The term "sample" refers to any cell, tissue, or fluid from a biological source 
(a "biological sample", or any other medium, biological or non-biological, that can 

15 advantageously be evaluated in accordance with the invention including, but not 

limited to, a biological sample drawn from a human patient, a sample drawn from an 
animal, a sample drawn from food designed for human consumption, a sample 
including food designed for animal consumption such as livestock feed, milk, an 
organ donation sample, a sample of blood destined for a blood supply, a sample from 

20 a water supply, or the like. One example of a sample is a sample drawn from a human 
or animal to whom a candidate drug has been given to determine the efficacy of the 
drug. 

A "sample suspected of containing" a particular component means a sample 
with respect to which the content of the component is unknown. For example, a fluid 

25 sample from a human suspected of having a disease, such as a neurodegenerative 
disease or a non-neurodegenerative disease, but not known to have the disease, 
defines a sample suspected of containing neurodegenerative disease aggregate- 
forming species. "Sample" in this context includes naturally-occurring samples, such 
as physiological samples from humans or other animals, samples from food, livestock 

30 feed, etc., as well as "structurally predetermined samples", which are defined herein 
to mean samples, the chemical or biological sequence or structure of which is a 
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predetermined structure used in an assay designed to test whether the structure is 
associated with a particular process such as a neurodegenerative disease. For 
example, a "structurally predetermined sample" includes a peptide sequence, random 
peptide sequence in a phage display library, and the like. Typical samples taken from 
5 humans or other animals include cells, blood, urine, ocular fluid, saliva, cerebro- 
spinal fluid, fluid or other samples from tonsils, lymph nodes, needle biopsies, etc. 

As used herein, "metal binding tag/metal/chelate linkage" defines a linkage 
between first and second species in which a first species is immobilized relative to a 
metal binding tag and a second species is immobilized relative to a chelate, where the 

10 chelate coordinates a metal to which the metal binding tag is also coordinated. U.S. 
Patent No. 5,620,850 of Bamdad, et al., incorporated herein by reference, describes 
exemplary linkages. 

The term "biological binding" refers to the interaction between a 
corresponding pair of molecules that exhibit mutual affinity or binding capacity, 

1 5 typically specific or non-specific binding or interaction, including biochemical, 

physiological, and/or pharmaceutical interactions. Biological binding defines a type 
of interaction that occurs between pairs of molecules including proteins, nucleic acids, 
glycoproteins, carbohydrates, hormones and the like. Specific examples include 
antibody/antigen, antibody/hapten, enzyme/substrate, enzyme/inhibitor, 

20 enzyme/cofactor, binding protein/substrate, carrier protein/substrate, 

lectin/carbohydrate, receptor/hormone, receptor/effector, complementary strands of 
nucleic acid, protein/nucleic acid repressor/inducer, ligand/cell surface receptor, 
virus/ligand, etc. 

The term "binding partner" refers to a molecule that can undergo binding with 
25 a particular molecule. Members of pairs of molecules that can undergo biological 
binding, as exemplified above, are examples. For example, Protein A is a binding 
partner of the biological molecule IgG, and vice versa. 

The term "determining" refers to quantitative or qualitative analysis of a 
species via, for example, spectroscopy, ellipsometry, piezoelectric measurement, 
30 immunoassay, electrochemical measurement, and the like. "Determining" also means 
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detecting or quantifying interaction between species, e.g. detection of binding 
between two species. 

The term "self-assembled monolayer" (SAM) refers to a relatively 
ordered assembly of molecules spontaneously chemisorbed on a surface, in which the 
5 molecules are oriented approximately parallel to each other and roughly perpendicular 
to the surface. Each of the molecules includes a functional group that adheres to the 
surface, and a portion that interacts with neighboring molecules in the monolayer to 
form the relatively ordered array. A wide variety of SAMs can be used in accordance 
with the invention, on a wide variety of surfaces, to present desired species such as 

10 binding partners, signaling entities, and the like at a surface of an article such as an 
electrode, colloid particle, or the like. Those of ordinary skill in the art can select 
from among a wide variety of surfaces, functional groups, spacer moieties, etc. for 
forming SAMs. An exemplary description can be found in U.S. Patent No. 5,620,850. 
See also Laibinis, P. E.; Hickman, J.; Wrighton, M. S.; Whitesides, G. M. Science 

15 245, 845 (1989), Bain, C; Evall, J.; Whitesides, G. M. J. Am. Chem. Soc. Ill, 7155- 
7164 (1989), Bain, C; Whitesides, G. M. J. Am. Chem. Soc. 1 1 1, 7164-7175 (1989), 
each of which is incorporated herein by reference. The formation of SAMs on fluid- 
suspendable particles such as colloid particles is described in U.S. Patent Application 
Serial No. 09/602,778, filed June 23, 2000, entitled "Interaction of Colloid 

20 Immobilized Species with on Non-Colloidal Structures", by Bamdad, et al., 

incorporated herein by reference. Certain embodiments of the invention make use 
of self-assembled monolayers (SAMs) on surfaces, such as surfaces of colloid 
particles, and articles such as colloid particles having surfaces coated with SAMs. In 
one set of preferred embodiments, SAMs formed completely of synthetic molecules 

25 completely cover a surface or a region of a surface, e.g. completely cover the surface 
of a colloid particle. "Synthetic molecule", in this context, means a molecule that is 
not naturally occurring, rather, one synthesized under the direction of human or 
human-created or human-directed control. "Completely cover" in this context, means 
that there is no portion of the surface or region that directly contacts a protein, 

30 antibody, or other species that prevents complete, direct coverage with the SAM. I.e. 
in preferred embodiments the surface or region includes, across its entirety, a SAM 
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consisting completely of non-naturally-occurring molecules (i.e. synthetic molecules). 
The SAM can be made up completely of SAM- forming species that form close- 
packed SAMs at surfaces, or these species in combination with molecular wires or 
other species able to promote electronic communication through the SAM (including 
5 defect-promoting species able to participate in a SAM), or other species able to 

participate in a SAM, and any combination of these. Preferably, all of the species that 
participate in the SAM include a functionality that binds, optionally covalently, to the 
surface, such as a thiol which will bind to a gold surface covalently. A self-assembled 
monolayer on a surface, in accordance with the invention, can be comprised of a 

10 mixture of species (e.g. thiol species when gold is the surface) that can present 

(expose) essentially any chemical or biological functionality. For example, they can 
include tri-ethylene glycol-terminated species (e.g. tri-ethylene glycol-terminated 
thiols) to resist non-specific adsorption, and other species (e.g. thiols) terminating in a 
binding partner of an affinity tag, e.g. terminating in a chelate that can coordinate a 

15 metal such as nitrilotriacetic acid which, when in complex with nickel atoms, captures 
a metal binding tagged-species such as a histidine-tagged binding species. The 
present invention provides a method for rigorously controlling the concentration of 
essentially any chemical or biological species presented on a colloid surface or any 
other surface. Without this rigorous control over peptide density on each colloid 

20 particle, co-immobilized peptides would readily aggregate with each other to form 
micro-hydrophobic-domains that would catalyze colloid-colloid aggregation in the 
absence of aggregate-forming species present in a sample. This is an advantage of the 
present invention, over existing colloid agglutination assays. In many embodiments 
of the invention the self-assembled monolayer is formed on gold colloid particles. 

25 Self-assembled monolayers can be made to be electrically conductive. As a working 
example, fig. 16 shows ACV demonstration of enhanced electronic communication 
across a self-assembled monolayer, and redox signaling of protein immobilization to a 
cell surface, against a control. 

A "self-assembled monolayer-forming species" comprises a species that, when 

30 exposed to an appropriate surface with other, like species, e.g. provided with like 
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species in an appropriate solution and exposed to an appropriate surface, will 
spontaneously form a self-assembled monolayer on the surface. 

A species "able to integrate into a self-assembled monolayer" (which can be a 
self-assembled monolayer forming species) is a species having a chemical 
5 functionality favoring participation in a self-assembled monolayer comprising the 
species and other, self-assembled monolayer-forming species with which it is not 
chemically incompatible. For example, the species may include a functional group 
selected to adhere to a surface on which the self-assembled monolayer is formed, and 
may include a remainder portion that may be approximately linear (not highly- 

10 branched), but which does not facilitate close packing. Molecules including a 

significant amount of unsaturation, for example a series of interconnected aromatic 
rings, are examples. Such a species may or may not be a self-assembled monolayer- 
forming species. Typically, species that are able to integrate into a self-assembled 
monolayer but are not able themselves to form a self-assembled monolayer will be 

1 5 able to participate in formation of and integrate into a self-assembled monolayer when 
present in an amount of up to about 50% as a percentage of overall species including 
chemically-compatible self-assembled monolayer-forming species. 

The term "self-assembled mixed monolayer" refers to a heterogeneous self- 
assembled monolayer, that is, one made up of a relatively ordered assembly of at least 

20 two different molecules. 

"Molecular wires" as used herein, means wires that enhance the ability for a 
fluid encountering a SAM-coated electrode to communicate electrically with the 
electrode. This includes electrically conductive molecules or molecules that can 
cause defects in the SAM allowing communication with the electrode. A non-limiting 

25 list of additional molecular wires includes 2-mercaptopyridine, 2- 
mercaptobenzothiazole, dithiothreitol, 1, 2-benzenedithioI, 1,2- 

benzenedimethanethiol, benzene-ethanethiol, and 2-mercaptoethylether. Conductivity 
of a monolayer can also be enhanced by the addition of molecules that promote 
conductivity in the plane of the electrode. Conducting SAMs can be composed of, but 
30 are not limited to: 1) poly (ethynylphenyl) chains terminated with a sulfur; 2) an alkyl 
thiol terminated with a benzene ring; 3) an alkyl thiol terminated with a DNA base; 4) 
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any sulfur terminated species that packs poorly into a monolayer; 5) all of the above 
plus or minus alkyl thiol spacer molecules terminated with either ethylene glycol units 
or methyl groups to inhibit non specific adsorption. A variety of molecules can be 
used for this purpose, including but not limited to poly (ethynylphenyl thiol) (i.e. 
5 Ci 6 Hi 0 S), referred to herein as MF1 ,: 

<°yH?y- • sh 

Thiols are described because of their affinity for gold in ready formation of a SAM. 
Other molecules can be substituted for thiols as known in the art from U.S. Patent No. 

10 5,620,820, and other references. Molecular wires typically conduct electronically or , 
because of their bulk or other conformation, creates defects in an otherwise relatively 
tightly-packed SAM to prevent the SAM from tightly sealing the surface against 
fluids to which it is exposed. The molecular wire causes disruption of the tightly- 
packed self-assembled structure, thereby defining defects that allow fluid to which the 

15 surface is exposed to communicate electrically with the surface. In this context, the 
fluid communicates electrically with the surface by contacting the surface or coming 
in close enough proximity to the surface that electronic communication via tunneling 
or the like, can occur. 

A "chimeric oligo solution" is an oligonucleotide sequence, such as DNA, that 

20 is simultaneously complimentary to two oligonucleotide identifiers each of which 
corresponds to two different binding partners. A complete set of chimeric oligo 
solutions represents a set of all possible combinations of interacting binding partners 
in any given procedure. 

The present invention provides methods by which a large number of proteins, such as 
25 those encoded by entire genomes of a species, can be simultaneously tested for 

interaction with any known or unknown component of a genome or with any chemical 
species. The invention also provides methods for detecting interactions involving 
proteins in which the nucleic acids, which encode them, are immediately available 
once the protein has been selected as an interacting species. 
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The invention also provides methods for detecting interactions between 
genetically encoded species and chemical species in order to identify new affinity 
reagents for biological and biochemical studies. Much of the following description 
involves a variety of methods, compositions and species, and articles for monitoring 
5 (detecting) interactions between chemical or biological species including techniques 
useful for drug screening. Major features of the following aspects of the invention 
include the following. Tools for proteomic studies including protein chips and 
particles for signaling interactions, and multi-particle systems such as two-particle 
systems. In multi-particle systems one particle can be a recruitable particle and the 

10 other particle can carry a binding partner of an agent presented by the recruitable 
particle and can also be a signaling entity or carry an auxiliary signaling entity. 
Another major area involves cell studies, especially techniques involving interactions 
between ligands and cell surface proteins and receptors. Discovery and therapeutics 
involving drugs that can effect these interactions also is described, with an emphasis 

15 on drug therapy involving angiogenesis. Specifically, cell receptor/1 igand interactions 
that can inhibit or promote angiogenesis are described. Another area involves 
detecting proteins, either in solution or on the surfaces of intact cells, for diagnostic 
purposes. 

A major disadvantage of existing in vitro binding assays is that they are not 
20 compatible with high throughput. Proteins must be sequentially tested in pair-wise 
binding assays because: 1) proteins cannot be amplified as nucleic acids can, making 
identification by sequencing after a binding assay difficult or impossible; therefore, 
the identity of each putative binding partner must be carefully tracked; 2) typical 
binding assays produce a single type of signal so that each pair to be tested must be 
25 kept in isolation so that a positive signal can be assigned to the appropriate binding 
partners. 

Among other aspects, the present invention solves these problems by 
providing a convenient method of tracking proteins that have been pooled, through 
the use of coding tags (identifiers) that are linked to putative binding species, which 
30 include proteins as well as chemical species. Methods of the invention detail 

techniques for "connecting" a coding identifier to a biological or chemical species via 
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co-immobilization on a common surface, which in a preferred embodiment is the 
surface of a particle. In another preferred embodiment, an oligo is used to identify the 
biological or chemical species, wherein the 4-bit code of a DNA uniquely identifies 
the co-immobilized species. In an especially preferred embodiment, the biological 
5 species under study is a protein that is expressed off of the encoding DNA, which 
uniquely identifies it. The invention also describes methods that facilitate the 
attachment of the identifier to a surface to which the biological or chemical species is 
also attached. In a preferred embodiment, an expressed protein and its encoding 
plasmid DNA are attached to a common particle via an affinity tag on the protein 

10 binding to a metal chelate on the particle and a DNA recognition motif contained 
within the plasmid binding to DNA-binding proteins on the particle surface. The 
invention further describes high throughput methods for detecting and selecting 
interacting partners, then rapidly identifying the interacting partners. 

A variety of techniques and components associated with various assays, kits, 

15 detection methods, etc. are described below. It is to be understood that the techniques 
of the invention involving oligonucleotide identifiers and proteins can be used in 
conjunction with any of the specific assays described herein, and these assays are 
provided by way of example only, as oligonucleotide identifier techniques of the 
invention can be used in conjunction with essentially any biological or chemical 

20 binding assay. Oligonucleotide identifier techniques of the invention are particularly 
well-suited to assays involving particles, beads, chips and colloids, in order to rapidly 
identify interacting protein partners from a pool of putative interactors, which are 
described below. 

Those of ordinary skill in the art will clearly understand where, in the 
25 following description, oligonucleotide identifier techniques of the invention can be 
used, and where oligonucleotide identifier techniques of the invention can be used in 
essentially any assay technique. 

In one aspect, the present invention contemplates interaction between chemical 
or biological agents for analysis, drug screening, or the like. The invention includes 
30 but is not limited to analyzing and/or inhibiting protein-protein interactions, protein- 
chemical species interactions, ligand-nucleic acid interactions, ligand-receptor 
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interactions, including but not limited to ligands on intact cells (growing on an 
electrode, or in solution or in suspension). The present invention contemplates a 
variety of embodiments, including the use of drug candidates, known or putative 
ligands, and small molecule drug libraries. 
5 One aspect of the invention involves oligonucleotide identifiers, which by 

definition include any number of bases (nucleotides), in which the 4-bit nucleic acid 
code is used to form a sequence that uniquely identifies some natural or synthetic 
material. This includes natural or synthetic nucleotide sequences, or derivatives of 
nucleic acids (including DNA and thiol-modified DNA, nucleotides fastened to 

10 polymer backbones, etc) that are adapted to be fastened to surfaces, that also can carry 
potential chemical or biological binding partners. The oligonucleotide identifiers can 
be short DNA sequences, for example from about 2 to about 20 bases in length, 
preferably from about 6 to about 12 bases in length. Longer oligonucleotide 
identifiers can be used as well, for example those of up to 50, or 100, or several 

15 hundred bases in length. 

According to the invention, an oligonucleotide identifier is attached to a 
surface to which a corresponding chemical or biological agent also is attached and 
which it will uniquely identify. The surface can be essentially any surface useful in 
chemical or biological analysis, including all surfaces described above, such as 

20 surfaces of particles such as fluid-suspendable particles or non-suspendable particles, 
larger surfaces such as those of chips, microarray chips, surfaces involved in 
electronic detection assays, cell surfaces etc. For example, the surface may be the 
surface of a colloid, where each colloid or set of colloids displays a single binding 
species and a single oligonucleotide identifier. Alternatively, the surface may be the 

25 surface of a spatially addressable array chip where multiple pairs of agents and 

identifiers are fastened in relatively close proximity to each other, but separated to the 
extent that they can be individually formed and analyzed. Each distinct spatial 
address of the chip displays a potential binding partner and an identifying 
oligonucleotide identifier nearby. 

30 Species of interest, which are surface-immobilized relative to their unique encoding 
identifiers, are allowed to interact with other species in solution or attached to other 
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surfaces or particles. Interacting species are isolated using any of a variety of 
techniques. The identities of the interacting species are then rapidly determined by 
sequencing, hybridizing, or otherwise determining the sequence of the attached 
identifier. 

5 Surfaces to which oligonucleotide identifiers of the invention are fastened or 

adapted to be fastened also can carry immobilized signaling entities such as those 
described herein and in International Pat. Apl Ser. No: PCT/USO 1/40801, filed 
05/25/01, entitled, "Electroactive Surface-Confinable Molecules", by Bamdad, et ah, 
incorporated herein by reference, as well as other documents incorporated herein by 

10 reference. As will be apparent from the description below, particles such as colloid 
particles can participate in oligonucleotide-identified interactions and may or may not 
carry auxiliary signaling entities. For example, in one set of embodiments colloid 
particles are used that are free of auxiliary signaling entities where the colloid 
particles themselves serve as signaling entities via color change upon agglomeration. 

15 In another set of embodiments colloid particles or other surfaces carry auxiliary 
signaling entities such as fluorescent markers (optionally different fluorescent 
markers at different wavelengths for different particles), electroactive species such as 
ferrocenes (optionally different ferrocenes with different oxidation/reduction 
potentials on different articles), etc. 

20 Fastening of oligonucleotide identifiers and chemical or biological species that 

may be binding partners to surfaces can be carried out according to any technique 
known in the art. Preferred techniques involve the use of self-assembled monolayers 
on surfaces. Self-assembled monolayer-forming species, or species able to integrate 
into a self-assembled monolayers can include chemical or biological species to be 

25 studied or oligonucleotide identifiers, and can thereby be incorporated into SAMs. 
Alternatively, species able to form or integrate into SAMs can include linkers for 
attachment to chemical or biological species or oligonucleotide identifiers, and the 
oligonucleotide can be attached to the surface after SAM formation via the linkers. 
Such linkers can include affinity tags or species that bind to affinity tags, species 

30 suitable for EDC/NHS coupling, oligonucleotide linkers, biotin-streptavidin 

interaction, species that can participate in DNA ligation techniques etc. Examples of 
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DNA ligation techniques include incorporation of a specific oligonucleotide sequence 
into a SAM, that encodes for a restriction site, and ligating a second oligonucleotide 
sequence, terminated in the same restriction site, to the original oligonucleotide 
sequence. An example of a commonly used ligase is T4 Ligase. Alternatively, a 
5 blunt-end oligonucleotide could be added using blunt-end ligation techniques. 

Preferred species for use with affinity tags include metals coordinated by chelates, for 
use with polyamino acid tags. For example, surfaces can be coated with SAMs 
exposing metals coordinated by chelates, and chemical or biological species, or 
oligonucleotide identifiers, can carry polyamino acid tags for coordination to the 

10 metal thereby linking the chemical or biological species, or oligonucleotide identifier, 
to the surface. Different chemistries, or the same chemistry can be used for linkage of 
any species or identifiers involved in the invention to a single surface. 

Referring now to Fig. 1, a surface of a colloid particle 140 adapted to bind 
essentially any chemical or biological species and also to bind an oligonucleotide 

15 identifier is illustrated schematically. Colloid 140 includes a surface 142 upon which 
is a SAM 144. SAM 144 is only partially illustrated - the SAM preferably will 
completely coat the surface 142. SAM 144 includes one species 146 that exposes, 
away from surface 142, terminating in a chelate able to coordinate a metal, or a 
chelate coordinating a metal, 148 (represented in the figure as NTA, nitrilotriacetic 

20 acid). A chemical or biological species 154, comprising a polyamino acid tag 156, 
when exposed to colloid 140, will fasten to chelate/metal 148. 

SAM 144 also includes a species 150 comprising an oligonucleotide linker 
152. An oligonucleotide species 158, including a linker portion 160 that is the 
complement of linker 152, and a section 162 defining an oligonucleotide identifier, 

25 when exposed to colloid particle 140 will fasten thereto. Thus, a plurality of colloid 
particles 140, each including an identical ly-derivatized surface can be provided as a 
kit. In an assay, different chemical or biological species 1 54 can be attached to one 
set of colloid particles to which is attached a unique oligonucleotide identifier 162. 
Different batches of colloid particles can carry different chemical or biological 

30 species and corresponding oligonucleotide identifiers. Of course, colloid particles can 
include solely species 146 where chemical or biological species and oligonucleotide 



-25 - 



identifiers each include polyamino acid tags 156, or colloid particles can include 
solely species 150 where both the chemical or biological species, and the 
oligonucleotide identifier, each includes an oligonucleotide linker 160. That is, 
oligonucleotide identifiers that uniquely identify a species attached to a surface need 
5 not be directly attached to the common surface. For example, for convenience, 
surfaces may be derivatized with a universal oligonucleotide. A second 
oligonucleotide comprised of a portion that is complementary to the universal DNA 
sequence, which is directly attached to the surface, and a second portion which 
uniquely identifies a chemical or biological species that is also attached to the surface. 

10 The arrangement of Fig. 1 can be used to provide a set of colloid particles is 

provided, each carrying an immobilized chemical or biological species such as a 
biological binding partner (e.g., a protein or small molecule), and each an 
immobilized oligonucleotide identifier. A record is made of the sequence of the 
identifier, and the identity of the chemical or biological species (potential binding 

15 partner) immobilized to the same colloid particles as the identifier. As an example of 
procedure, sets of colloids each bearing a distinct species are pooled together in 
solution and allowed to interact. This next step is an alternative to separating out 
interacting particle-immobilized species, then sequentially releasing and sequencing 
the attached oligo identifiers in order to identify interacting species. A set of oligos is 

20 added that we call "chimeric oligo solutions". Each DNA strand in this set is 
comprised of a chimeric sequence that is complementary to two oligonucleotide 
identifiers. The entire set of "chimeric oligo solutions" would contain chimeric 
sequences that represent all possible solutions to the problem of which species interact 
with each other. These "chimeric oligo solutions" are then incubated with the colloids 

25 that present putative binding partners and attached oligo identifiers. The "chimeric 
oligo solutions" that have simultaneously hybridized to oligonucleotide identifiers on 
two sets of colloids that bear interacting partners, identify which putative binding 
species interact with each other. The hybridized "chimeric oligo solutions" need to be 
separated from oligos that remain free in solution as well as from oligos that have 

30 hybridized to only one sequence identifier. Free oligos are easily removed by 

pelleting the colloids and discarding the supernatant. Chimeric oligo solutions that 
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are hybridized to only one sequence identifier can be distinguished from those 
hybridized to two enzymatically digesting the free end with enzymes that degrade 
single stranded nucleic acid strands. Chimeric oligosolutions are then released from 
the particles by any one of a number of methods including dissociation by heated 
5 water, chemical release, etc. The chimeric oligosolutions are then sequenced to reveal 
the identity of the interacting partners. The chimeric oligosolutions can also be 
enzymatically amplified (such as by PCR) prior to the sequencing step. 

In one embodiment, the invention involves an oligonucleotide identifier that 
uniquely identifies a protein wherein the oligonucleotide identifier is the very 
10 sequence that encodes the protein. Both the expressed protein and the oligonucleotide 
identifier (e.g. DNA) that encoded it are immobilized relative to each other, e.g. each 
is immobilized relative to a linker species, so that the protein is presented for binding 
studies and its oligonucleotide identifier is retrievable after selection of interacting 
pairs of proteins. 

15 The oligonucleotide identifier can be in essentially any form, e.g. plasmid 

form, as in a protein expression vector, or in a linear form, such as nucleic acids 
strands that are generated by a polymerase chain reaction (PCR). PCR generated 
fragments can be engineered to include sequences, such as a start site of transcription, 
to promote protein expression then translation. These nucleic acid templates are 

20 especially well suited for in vitro, cell-free protein expression systems such as the 
Rapid Translation System (RTS) sold by Roche Diagnostics. 

This embodiment of the invention is facilitated as follows. A protein is 
expressed from an expression vector or template. The solution in which the protein is 
expressed will then contain both the expressed protein of interest (which can be a 

25 putative binding partner) and the DNA that encoded the protein (which will serve as 
the oligonucleotide identifier). Both the protein of interest and the oligonucleotide 
identifier that encodes the protein can be immobilized to a common linker species by 
a variety of methods. Where the linker species is an article, the oligonucleotide 
identifier can be immobilized relative to the surface of the article by a variety of 

30 methods as described herein. E.g., the surface can be derivatized to present both a 
moiety to facilitate the attachment of the protein of interest as well as a moiety to 
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facilitate the attachment of the oligonucleotide identifier, as described herein. The 
protein can be expressed with an affinity tag from a DNA template that contains a 
convenient functionality that facilitates the DNA's attachment to a surface that also 
presents binding partners of the protein's affinity tag. For example, an affinity tagged 
5 protein can be expressed off a DNA template that bears biotin. When exposed to a 
surface bearing both a metal binding tag/metal/chelate linkage and streptavidin, both 
gene (a section of the oligonucleotide identifier) and gene product (protein) will be 
captured and presented on the common surface. A convenient approach is to express 
a histidine-tagged protein from a biotinylated template. Surfaces, including particles, 

10 such as colloids, are coated with heterologous SAMs bearing both NTA-nickel (the 
binding partner of the histidine tag) and biotin. The surfaces are first exposed to 
streptavidin, which has four binding sites for biotin. This surface will then capture 
and present the histidine tagged gene product as well as its encoding DNA sequence. 
As another example, proteins of interest fused to glutathione-S-transferase (GST) can 

15 be attached to entities or articles presenting glutathione. Alternatively, the protein of 
interest can be expressed as a fusion protein with thioredoxin, then attached to a 
surface that presents a binding partner of thioredoxin. GST and thioredoxin serve a 
dual purpose. First, these proteins provide a convenient affinity tag for the protein of 
interest. Secondly, proteins commonly used as fusion partners increase the solubility 

20 of the expressed protein of interest and thus increase the effective concentration of the 
protein. 

In another technique for forming an oligonucleotide identifier immobilized 
relative to a protein for which is encodes, via a linker species, the oligonucleotide 
identifier may also contain a binding site for a DNA/RNA-binding protein. This 

25 sequence is preferably inserted downstream of the gene to be expressed. The protein 
that binds to the binding site on the oligonucleotide identifier can then be immobilized 
at a surface (e.g. on a particle) which also presents a moiety for the attachment of the 
expressed protein (e.g. NTA/nickel). In one embodiment, a nucleic acid binding 
protein is immobilized on the surface and used to capture the oligonucleotide 

30 identifier. Nucleic acid binding proteins bind to DNA or RNA by recognizing either a 
specific nucleic acid sequence motif or by recognizing a tertiary structure. For 
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example, a common DNA-binding yeast protein is Gal4. It binds as a dimer to a 
specific sequence of double stranded DNA, while single stranded binding protein 
binds to single stranded DNA. Other proteins bind to structural elements of DNA or 
RNA, such as binding to cruciform DNA, hairpins or to specific RNA loops. 
5 Ideally, the protein binding site sequences inserted into the template that 

encodes the protein of interest are recognition motifs for a protein from a species 
distinct from that of the expression system. For example, if the protein is expressed 
using an E. coli based system, then the oligonucleotide identifier is attached to the 
common surface by binding to a yeast protein, such as Gal4. In this way, the protein 

10 expression system does not include or produce extraneous proteins that would 
compete for the cognate binding site on the oligonucleotide identifier. 

In an alternative approach, the protein of interest is expressed (by an 
oligonucleotide identifier) as a fusion protein with a protein fragment that has a 
binding partner that can be attached to the oligonucleotide identifier. This provides a 

15 method for attaching the oligonucleotide identifier to the protein of interest. There are 
a variety of proteins or fragments of proteins that can be genetically fused to the 
protein of interest, which also bind to small molecules which can be used to modify 
DNA. Short strands of DNA that are so modified can be used as primers in PCR 
reactions to produce expression templates from which flision proteins that capture 

20 their own oligonucleotide identifier can be produced. 

For example, a PCR product that contains sequences that encode the protein of 
interest as well as sequences that encode streptavidin can be generated using 
biotinylated primers. The resultant PCR product, which will also include elements 
necessary for transcription/translation, serves as a template for protein expression and 

25 also yields an oligonucleotide identifier that has been adapted to be fastened to the 

protein expressed via the biotin-streptavidin linkage. If desired, the streptavidin may 
be expressed as a fragment that only binds one biotin. Additionally, the fusion protein 
which contains streptavidin and the protein of interest may also contain an affinity tag 
to facilitate attachment of the protein/oligonucleotide identifier to a common surface 

30 or particle. 
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In another example, the protein of interest is expressed as a fusion protein with 
a DNA-binding protein, such as LexA, from an expression vector or template that 
contains DNA-binding sites for LexA. In this way, the gene product and gene are 
linked to each other. 

5 To facilitate the detection of binding events between proteins of interest, one 

of the proteins can be modified with a signaling entity. One way to accomplish this is 
by direct attachment of the signaling entity to one of the proteins of interest. Another 
way is to genetically fuse the protein of interest to a protein that has a signaling 
capability. For example, a protein of interest can be expressed as a green fluorescent 

10 protein (GFP) fusion protein. 

Alternatively, the proteins of interest have signaling capability when they are 
co-immobilized with a signaling entity on a common surface. For example, a protein 
of interest can be attached to a polymer to which a signaling entity such as Ru 
complex has also been attached. Binding partners are attached to recruitable particles 

1 5 such as magnetic beads and then detected by ECL methods. Similarly, proteins of 
interest can be co-immobilized on particles that also present signaling entities. For 
example a protein of interest can be attached to a particle or colloid that also presents 
signaling entities such as fluorescent or phosphorescent or redox-active moieties to 
facilitate optical or electrochemical detection, respectively. 

20 In yet another embodiment, the oligonucleotide identifier that encodes the 

protein of interest is generated such that it also possesses signaling capabilities. For 
example, the DNA expression template can be generated by PCR using primers in 
which at least some of the bases have been chemically modified with redox-active 
moieties such as ferrocene derivatives. The protein of interest and the oligonucleotide 

25 identifier (expression template) are then attached to a common surface, such as a 
colloid particle. The particles are mixed with putative binding partners attached to 
magnetic beads, and magnetically drawn to a sensing electrode where the presence of 
the redox-active molecule, hence the interaction, is detected. Alternatively, bases 
modified with a signaling functionality can be used in the polymerase chain reaction 

30 to generate an expression template that can signal. Similarly, the protein of interest is 
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expressed from a DNA template that has been modified with a fluorescent or 
phosphorescent moiety. 

In one example of synthesis and use, a nucleic acid is generated that encodes a 
histidine-tagged protein of interest (for attachment of the protein to. a surface that 
5 displays an NTA-Ni** moiety) and that also contains a Gal4 binding site for 

attachment of the oligonucleotide identifier to the common surface. The protein is 
preferably expressed in a cell-free system to minimize levels of background DNA and 
proteins. After protein expression, the reaction mixture is incubated with particles 
(e.g. colloids or beads) that present metal/chelate (e.g. NTA-Ni) moieties and Gal4 

10 for fastening of the protein and oligonucleotide identifier, respectively. The particle- 
immobilized NTA-Ni^ moieties capture and present the protein of interest (via the 
binding tag) and the particle-immobilized Gal4 captures the DNA that encodes the 
expressed protein, because it also contained Gal4 recognition sites. In this way an 
identifying oligonucleotide identifier has been immobilized relative to the expressed 

15 protein. 

These particles can then be used in a variety of assay formats. The following 
assay is compatible with detecting protein-protein interactions involving large 
numbers of possible binding partners. A portion of the putative binding partners is 
attached to a set of colloids, which also bear Gal4 for the attachment of the nucleic 

20 acid, which encoded the protein. The colloids also carry signaling entities. The 
signaling entities can be electroactive, electrochemiluminescent, optical, etc. A 
second set of putative binding partners is attached to magnetic particles. Large 
numbers of magnetic particles and colloidal particles are mixed together to allow 
binding to occur between proteins immobilized relative to the beads and particles. A 

25 magnetic field is then used to collect the magnetic particles at a sensing location 
where the signals carried on the colloids can be detected. 

As described, a gene product (protein) and the gene (oligonucleotide 
identifier) are connected to each other via attachment to a linker species. Protein 
recognition motifs (DNA or RNA sequences) are inserted into an expression (or 

30 translation) vector up- or downstream of the sequences that encode the gene of 

interest. These DNA recognition motifs facilitate attachment of the gene to a surface 
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or particle presenting the cognate DNA-binding protein. The surface also presents a 
moiety to facilitate attachment of the gene product. For example, surfaces that 
present both NTA-Ni and Gal4 will capture histidine-tagged proteins and DNA that 
bears Gal4 recognition sequences (motifs). The nucleic acids referenced here can be 
5 any nucleic acid encoding the gene of interest and having the ability to promote 

transcription or translation, including: a protein expression vector (plasmid); a linear 
piece of double stranded DNA, such as a PCR product, including those designed for 
in vitro protein expression or translation. 

The nucleic acid that encodes the gene of interest is chemically modified to 

10 facilitate its attachment to a surface or a particle. For example, a protein is expressed 
in a cell-free expression system off of a biotinylated PCR product. If the gene of 
interest has been modified with an affinity tag such as GST, both the gene product and 
the gene can be attached to a surface that presents both glutathione and streptavidih 
(or other avidin derivative). 

15 The gene of interest is expressed as a fusion protein with a protein that can be 

readily attached to the encoding gene. For example, the gene of interest is expressed 
as a LexA fusion protein and the DNA template or plasmid, used for protein 
expression or translation, also contains DNA binding sites for LexA. As another 
example, the gene of interest is expressed as a streptavidin fusion protein and the 

20 DNA template used for protein expression or translation, is biotinylated. 

The present invention allows the investigation of interactions between 
chemical or biological species, such as protein-protein binding interactions, on a very 
large scale. Large numbers of potential binding interactions can be investigated in a 
single solution. Once many potential binding partners are brought together, they must 

25 be separated to identify which were involved in binding interactions. The following is 
a description of techniques for separating species that have been involved in 
interaction from species that have not. 

As one example, magnetic in situ selection/dilution can be used, as described 
in a U.S. patent application, filed 10/03/01, entitled "Magnetic In Situ Dilution", by 

30 Bamdad, incorporated herein by reference. 
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The magnetic in situ selection/dilution technique can be used in connection 
with any format in which a potential binding partner is immobilized relative to a 
particle that can be drawn to a magnet, where once the particle is drawn to the 
magnet, whether the species has been involved in a binding interaction can be 
5 detected. For example, potential binding partners can be immobilized relative to 

magnetic particles, and species that might bind with these species can be labeled with 
signaling entities such that once a particle is drawn to a magnet the presence or 
absence of the signaling entity in proximity of the magnet can be detected. For 
example, as described in the above-noted patent application incorporated by 

10 reference, an array of electrodes can be provided on a surface, each electrode 
individually addressable, and each electrode accompanied by a corresponding 
individually-addressable magnet. A plurality of magnetic beads can be provided, 
each carrying an immobilized potential biding partner. A plurality of species that are 
putative binding partners of the binding partners immobilized on the magnetic beads 

15 can be provided, each of these species immobilized relative to a signaling entity. The 
binding partners on the beads can be the same or different, and the species that are 
their putative binding partners can be the same or different. Mixing the magnetic 
bead-immobilized binding partners with the species that are their putative binding 
partners may result in some magnetic beads immobilized relative to signaling entities 

20 (where binding has occurred between putative partners) and other magnetic beads that 
are not immobilized relative to signaling entities. Following mixture, within a fluid 
medium, of the putative binding partners, and optionally without any wash step, the 
solution can be exposed to the electrode array. Beads can be magnetically drawn to 
electrodes and determination can be made as to which electrodes have not drawn 

25 beads immobilized to signaling entities (for example, optically where the signaling 
entities are optical signaling entities or electrochemical ly where the electrodes 
corresponding to the magnetic beads generate a signal determinative of whether a 
signaling entity is or is not in proximity). Following this, individual magnetic regions 
that have not attracted beads carrying immobilized signaling entities are de- 

30 magnetized, releasing the beads, which are washed away. Magnets at which beads 

carrying signaling entities have been attracted remain magnetized during this washing 
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and removal step. Subsequent to removal, all magnets are released and the process is 
continued until, statistically, only one magnetic bead exists for each magnet. At this 
point, after a magnetic attraction step, each magnet that has not attracted a bead 
carrying an immobilized signaling entity is released, and all remaining magnets will 
5 have attracted only magnetic beads carrying a pair of binding partners indicated by 
the presence of a signaling entity at the magnet. At this point, these species are 
released and the identity of binding partners is identified via identification of 
oligonucleotide identifiers described herein. This is the technique described generally 
in the above-identified U.S. patent applications. 

10 As a specific example, a first set of gene products (proteins) and their 

respective genes (oligonucleotide identifiers) are attached to magnetic beads. A 
second set of gene products and their respective genes are attached to particles, such 
as colloids, that also bear electronic signaling moieties. The two particle populations 
are incubated together in solution and binding interactions between proteins 

15 immobilized on different particles are allowed to occur. When an interaction takes 
place between a protein on a magnetic bead and a protein on a colloid bearing 
electroactive groups, the recruitable particle (magnetic bead) becomes connected to 
the signaling particle. A magnetic field draws the complex to a sensing electrode 
where an electronic signal is transduced. The sensing electrode is an array of 

20 electrodes configured such that the recruitment of magnetic beads to each electrode 
pad is individually controlled. This can be accomplished by interfacing each 
electrode pad with an individually controllable electromagnet. That is, an array of 
individually-addressable electromagnets is fabricated (using techniques known to 
those of ordinary skill in the art). Where the electronic signaling entity is a redox- 

25 active molecule such as ferrocene, an electrode is associated with each magnet such 
that when beads are drawn to a magnet, the electrode associated with the magnet is 
cycled and if a redox-active signaling entity is immobilized relative to the bead (via 
binding partner interaction), then the redox signature of the signaling entity will be 
detected. 

30 Another example involves optical selection. In this technique, binding 

partners identified by immobilized oligonucleotide identifiers, that participate in 
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binding interactions are optically detected by any technique described herein such as 
visual or automatic observation of color change upon colloid-colloid aggregation, 
colloid "decoration" of beads upon aggregation, etc. As a specific example, 
interacting partners, attached to different particle types, can be optically selected by a 
5 variety of methods. A first set of gene products (proteins that are putative binding 
partners) and genes (oligonucleotide identifiers) can be attached to non-magnetic 
beads. A second set of gene products can be directly or indirectly modified with 
fluorescent or phosphorescent moieties. For example, the second set of gene products 
can be attached to colloids that also bear fluorescent or phosphorescent moieties. The 

10 interaction between bead-immobilized and colloid-immobilized proteins causes the 
agglomeration of the fluorescent or phosphorescent colloids onto the larger bead and 
renders it (the bead) detectable and identifiable as a bead that presents a protein that is 
participating in an interaction. In an alternative approach, the second set of gene 
products is expressed as a fusion protein with a DNA binding protein or domain and 

1 5 the is expressed or translated off of a fluorescent or phosphorescently labeled nucleic 
acid. As in the previous case, the interaction between bead-immobilized proteins and 
the fusion proteins causes the agglomeration of a fluorescent or phosphorescent 
species onto the bead and renders it (the bead) detectable and identifiable as a bead 
presenting a protein that participates in an interaction. 

20 Beads decorated with an optically detectable species can be manually isolated, 

prior to identifying the interacting species, by merely picking then analyzing the 
beads that fluoresce. 

Alternatively, the isolation of interacting species can be automated. Beads 
bearing fluorescent or phosphorescent moieties can be selected and isolated by 

25 utilizing a FACS (fluorescence-activated cell sorting) system, which is a technique 
well-known to those of ordinary skill in the art. 

Interacting particle-attached species are first isolated. After isolating 
interacting species, the attached nucleic acid sequences are identified by sequencing, 
PCR, hybridization or a combination of these techniques. Alternatively, if the protein 

30 has been expressed off plasmid DNA, the encoding DNA can be released from the 

particles, and plated onto growth media. Once new colonies are grown, these provide 
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a renewable source of the encoding DNA for analysis by any of the methods 
mentioned above (sequencing, PCR, hybridization or combinations of these 
techniques). 

An advantage of the present invention is that large numbers of proteins can be 
5 simultaneously tested to identify interacting pairs without apriori knowledge of the 
identity of either protein. Using standard techniques, libraries of complementary DNA 
(cDNA) can be inserted into protein expression plasmids or linear nucleic acid 
templates that have been modified to facilitate direct or indirect attachment to the 
encoded protein as described herein. For example, total mammalian cDN A can be 

10 inserted into plasmids that also contain sequences that encode a histidine tag and 

DNA binding sites for a yeast DNA binding protein, which would be presented on a 
particle that also presented NTA-Ni for the capture of the expressed protein. In this 
way, the expressed protein and its encoding DNA would be "attached" to each other 
via immobilization on a common particle. Similar genetic manipulations, when 

15 coupled with methods of the invention designed to detect and select interacting 
partners, enable the characterization of large numbers of interacting proteins and 
elucidation of interaction networks of entire genomes. 

Methods of the invention are also used to functionally characterize an 
unknown protein of interest. With the recent sequencing of the human genome, it is 

20 now imperative to determine the function of newly identified genes. The number of 
genes in the human genome is estimated to be about 40,000. However, there are 
protein recognition motifs that are common to several proteins. One method of 
characterizing unknown genes is to determine which proteins the gene product 
interacts with. A less complex method is to determine which of the known protein 

25 recognition motifs the uncharacterized protein interacts with. A library of 

nanoparticles can be generated that each display a different protein recognition 
module, such as a kinase domain, phosphorylase, a PDZ domain, GRB1 and 2 
domains, ERK, kringle, WW domains and the like. Uncharacterized proteins and 
their encoding DNA are separately attached to magnetic beads, then mixed with a 

30 library of colloid particles each bearing an interaction domain and its associated 
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DNA. Interacting particles are then either optically or magnetically selected, then 
identified via analysis of attached nucleic acids or other encoding identifiers. 

Fragments of DNA that encode proteins can be readily inserted into either 
plasmids or linear nucleic acid templates to facilitate expression of the encoded 
5 proteins. cDNA libraries that are inserted into a linear expression/translation template 
(for in vitro translation) offer certain advantages over insertion into plasmids. Hybrid 
nucleic acid expression templates can also be produced by generating the cDNA 
library with primers that contain both sequences that encode an affinity tag for the 
expressed protein and a functionality to facilitate attachment of both the gene product 

10 and its encoding DNA to a common surface. Linear expression templates can be 

readily modified with chemical functionalities to facilitate attachment to surfaces or 
functionalities to add a signaling capability. For example, protein encoding nucleic 
acid fragments can be genetically fused to nucleic acid strands that carry a biotin 
moiety. Biotinylated nucleic acids of any sequence are commercially available. 

15 Streptavidin has four binding sites for. Therefore, biotinylated nucleic acid 
expression templates bind to surfaces that present streptavidin. 

Libraries of cDNAs that have been inserted into plasmids offer other 
advantages. Once an interaction has been detected, the interacting species and the 
plasmids that encode them can be isolated. The plasmids can then be introduced into 

20 a host cell in order to amplify the plasmid prior to sequencing or use in additional 
assays. 

Another method of the invention involves fusing components of a cDNA 
library, for example representing the entire mammalian genome, to nucleic acids that 
encode a DNA-binding protein, or fragment thereof, and the DNA sequences to which 

25 it binds. Fusion proteins expressed from these templates will be able to bind to their 
encoding nucleic acids. 

The invention also provides methods for identifying affinity reagents for use 
in protein purification, immobilization and other general biochemical techniques. 
Because the interaction aspect of the invention is not performed in a host cell, 

30 interacting components need not be entirely generated from genetic material. In this 
embodiment, a first set of chemical species is attached to recruitable beads along with 
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a identifier that codes for the chemical identity of the immobilized chemical 
compound. A second set of genetically encoded species (preferable poly amino acids 
4-14 amino acids in length) is attached to a linker species that has a signaling 
capability. The chemical and biological components, one or both of which may be 
5 attached to surfaces, are mixed together such that binding interactions can occur. 

Methods of the invention are used to select interacting components. The components 
of each interacting pair are then identified. This can be accomplished using methods 
of the invention such as sequencing attached nucleic acid identifiers or encoding 
plasmids or by using other analytical tools, which are better suited for the 

10 identification of chemical compounds and peptides. These methods include but are 
not limited to mass spec techniques and peptide sequencing techniques. Methods of 
the invention can also be used when the surface is the surface of a chip. With 
reference to Fig. 2, in one embodiment, a set of derivatized colloids, each bearing a 
putative binding partner and identifiers, are introduced to a spatially addressable array 

15 chip that presents putative binding species in close proximity to an identifying 

oligonucleotide identifier. Chip-immobilized species are allowed to bind to colloid- 
immobilized species, then colloids bearing non-binders are washed away. A set of 
oligos (identifiers), which contains sequences complementary to all possible pair-wise 
combinations of the sequences that identify putative binding partners, is then 

20 incubated with the colloid-decorated chip. After rinsing, the oligos that have 

hybridized to the identifiers on the colloids and the chips represent identification of 
the putative binding species that interacted with each other. The "identifier" oligos 
are dissociated from the surface by any one of a number of methods including 
dissociation by heated water, chemical release, etc., and are then sequenced to reveal 

25 the identity of the interacting partners. The identifier oligos can also be enzymatically 
amplified at their specific locations while on the chip (such as by PCR) prior to the 
sequencing step. As shown, referring to Fig. 2, a chip 164 having a surface 166 
includes a plurality of spatially-addressable regions 168, 170, etc. Each region 
includes a chemical or biological species (putative binding species) 174, 176, 178, etc. 

30 Each region also includes an oligonucleotide identifier 180, 182, 184, etc. Identifiers 
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180, 182, and 184 uniquely identify chemical or biological species 174, 176, and 178, 
respectively. 

Following the reaction described above, as illustrated, one colloid particle 186 
remains immobilized at surface 166. Colloid 186 includes, fastened thereto, a 
5 chemical or biological species 188 that binds to species 174, and oligonucleotide 
identifier 190. Binding of species 174 and 188 brings identifiers 180 and 190 into 
close proximity, whereby an interaction hybridization identifier 192 binds to the 
combination of identifiers 180 and 190. Identification of identifier 192 identifies the 
sequences of identifiers 180 and 190, identifying one or both of chemical or biological 

10 species 174 and 188. 

Array chips that display a multitude of chemical or biological species that are 
putative binding partners, by and unique identifying oligos nearby, can be generated 
by a variety of techniques. One method involves forming heterologous self- 
assembled monolayers (SAMs) on surfaces that incorporate an entity that facilitates 

15 the attachment of a protein (e.g. a chelate/metal - terminated species that can 

participate in a SAM) and a second entity that facilitates the attachment of a nucleic 
acid species (e.g. a nucleic acid linker - terminated species that can participate in a 
SAM). For example, a mixed SAM can be formed from mixed thiol species that 
include a thiol-modified strand of DNA and a thiol terminated in nitrilo tri-acetic acid 

20 (NTA). NTA, when complexed with nickel, selectively captures histidine-tagged 

proteins. This heterologous SAM would then be able to capture any histidine-tagged 
protein and any strand of DNA that contains a sequence complementary to that 
displayed on the chip. Alternatively, a SAM exposing a single linking species (e.g. 
chelate/metal) can be formed, and used with both a binding partner that carries a 

25 polyamino acid tag and an oligonucleotide modified with a polyamino acid tag. The 
SAM-forming step is usually performed by incubated a gold-coated surface with 
various thiols in organic solvents. Organic solvents spread on metal surfaces. 
However, once the SAM has been formed, the steps of incubating a protein with the 
surface, then hybridizing a unique nucleic acid strand to the surface via a common 

30 "tail" that hybridizes to the surface-immobilized oligo, are both performed in aqueous 
buffer which beads up on SAM-coated surfaces. In this way, an entire surface may be 
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coated with a universal SAM, which is then "dotted" with small volumes of protein 
and DNA to generate an array chip. 

Alternatively, various species of DNA identifiers that are able to bind to a 
surface, for example via thiol modification, in aqueous solution, can be dotted onto an 
5 underivatized gold-coated surface. After some incubation period, the chip is then 

exposed to a solvent containing "filler" thiols. These filler species may also contain a 
thiol species that facilitates the attachment of proteins to the surface, which may be 
carried out in a spatially addressable way. 

Standard gene chips can also be used with the methods described herein, if 
10 they are modified to allow the placement of proteins in close proximity to an 
identifier. 

Referring now to Fig. 3, another embodiment of the invention is illustrated. A 
chip 240 includes a surface 242 to which a plurality of chemical or biological species 
244 - 250, etc., are fastened. Exposure of the surface to colloid particles 252 each 

15 carrying a chemical or biological species 254 and an oligonucleotide identifier 256 
results in binding between species 246 and 254, as illustrated. Subsequently, 
localized cleavage and identification of identifier 256, or localized PCR or 
hybridization can identify binding. Species 244 - 250 can be identical, with non- 
identical species attached to colloids, or species 244— 250 can be different with 

20 identical species attached to colloids exposed to the surface, or both species attached 
to the surface and the species attached to the colloids can be varied. 

Referring now to Fig. 4, one arrangement for fastening an oligonucleotide to a 
surface is illustrated. Fig. 4 illustrates a self-assembled monolayer-forming species 
incorporating an oligonucleotide identifier. Specifically, the self-assembled 

25 monolayer-forming species is a long-chain alkyl thiol including a restriction enzyme 
cleavage site 100, a DNA priming region 102 linked to the cleavage site, and a 5-base 
oligonucleotide identifier 104 linked to the priming region. It is to be understood that 
the priming region can be defined by any entity that can link the oligonucleotide 
identifier to the self-assembled monolayer-forming species and also promote 

30 sequencing as will be described below with reference to Fig. 5. It is also to be 
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understood that the arrangement of Fig. 4 is by example only, and other techniques for 
linking an oligonucleotide identifier to a surface can be used. 

Fig. 6 illustrates incorporation of the species of Fig. 4 into a self-assembled 
monolayer onto a surface, specifically a surface of a colloid, along with 
5 immobilization of a biological binding partner to the same surface. A colloid particle 
106 has a gold surface to which thiols will bind and upon which thiol-contained self- 
assembled monolayer-forming species will form self-assembled monolayers. A self- 
assembled monolayer 108 is formed on a surface 1 10 of colloid 106, including both 
the species of Fig. 4 and an immobilized biological binding partner (protein) 112. As 
10 illustrated, species 1 12 is fastened to a self-assembled monolayer-forming species that 
is incorporated into SAM 108. Specifically, the species is linked to the self- 
assembled monolayer-forming species via a metal binding tag/metal/chelate linkage 
114. 

Fig. 7 illustrates, schematically, colloid particle 106 carrying binding species 
15 112 and a second colloid 118 carrying a chemical or biological species 120 which is a 
biological binding partner of species 1 12. As illustrated, species 1 12 and 120 are each 
protein, although as would be understood by those of ordinary skill in the art, other 
binding species can be used. For example, any of a variety of proteins, peptides, or 
other species can define species 1 12 and 120, and species 112 and 120 can be fastened 
20 to the colloid particle via other affinity tags. The oligonucleotide identifier and/or 
chemical or biological species also can be fastened to the surface of the colloid via 
any of a variety of affinity tags, or a carboxylic acid thiol via EDC/NHS coupling. 
Alternatively, a species of interest can be attached to a thiol for direct attachment to 
the colloid. 

25 Self-assembled monolayer-forming species, as illustrated, can include long 

carbon chains such as 1 1 carbons or greater. 

Once species 1 12 and 120 are allowed to participate in a binding assay 

(including any assay described in the documents incorporated herein by reference 

such as bead coloration assays, colloid-colloid assays, etc.), if one of species 1 12 or 
30 120 is identified as a binding partner in the assays, the identity of which would be 

desirably known, then the identity of the species can be uncovered as follows. For 
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purposes of this discussion it is assumed that colloid 106 carries a known binding 
partner 1 12 and colloid 1 1 8 carries a species 120 that was not known to be a 
biological binding partner of species 1 12. Thus, once identification of the binding 
between species 1 12 and 120 is known (for example via aggregation of colloid 
5 particles), where a variety of other colloid particles carrying species other than 120 
were involved in the assays and may have bound to species 112, then the identity of 
species 120 is desirably determined. With reference to Fig. 8, a restriction enzyme is 
added to cleave oligonucleotide identifier 104 from colloid particle 1 18 (along with 
the priming region 102). Subsequently, with reference to Fig. 5 an oligonucleotide 

10 primer 122 complementary to the priming region 102 is added, and normal PCR- 
based sequencing, or other standard sequencing, is performed on the cleaved 
oligonucleotide to decipher the sequence of the oligonucleotide identifier. Standard 
fluorescent sequencing can be carried out, for example. Once the sequence of the 
oligonucleotide identifier 104 is identified, it in turn identifies species 120 as that 

15 species that had been bound to colloid particle 1 18 to which identifier 104 also had 
been bound; identifier 104 had been correlated to species 120 prior to running of the 
assay. 

Another technique for use of oligonucleotide identifiers is described now with 
reference to Figs. 9-15. Fig. 9 illustrates an oligonucleotide identifier 124 which is 

20 part of a self-assembled monolayer-forming species 125. Identifier 124 can be 

fastened to and form a part of SAM-forming species 125 via any technique, and need 
not include a restriction enzyme cleavage site or priming region 100 and 102, 
respectively, as illustrated in the figures above. Fig. 9 also illustrates identifier 124 
and a chemical or biological species 126, each fastened to the surface of a colloid 

25 particle 128 via self-assembled monolayer-forming species. That is, a SAM on the 

surface of colloid 128 includes both identifier 124 and species 126. Identifier 124 and 
species 126 can be fastened to colloid 128 in any manner known to those of ordinary 
skill in the art, including any manner described herein. Referring now to Fig. 10, 
colloid 128 is brought into proximity with a second colloid particle 130, which carries 

30 its own oligonucleotide identifier 132 and its own chemical or biological species 134. 
If chemical or biological species 126 and 134 bind, for example if they are biological 
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binding partners, then oligonucleotide identifiers 124 and 132 will be brought into 
close proximity with each other. As an example of an assay shown in Fig. 10 a 
plurality of colloids 128 each carrying a plurality of binding partners 126 is provided 
and mixed with a plurality of colloids 130, each carrying a different species that may 
5 or may not be a binding partner of species 126. Colloid 130 carries species 134 which 
is a binding partner of species 126, as determined by the aggregation of colloid 
particles 128 with colloid particles 130, for example. The binding of species 126 and 
134 having been determined, it is desirable (in this example) to determine the identity 
of species 134. 

10 Alternatively, a large number of colloid particles carrying a wide variety of 

potential binding partners can be admixed, where a variety of different binding 
interactions may occur. 

In any event, once binding between species 126 and 134 has occurred (e.g. via 
detection of a color change from pink to blue as aggregation of colloid particles 

15 occurs), then the identity of species 126, or 134, or both is determined. Referring now 
to Fig. 1 1, identifiers 124 and 132 together define a sequence that may be 
complementary to an interaction hybridization identifier 136. Where this is the case, 
identifier 136 will bind to the combination of identifiers 124 and 132. This will occur 
when a variety of interaction hybridization identifiers are added to the assay, each 

20 corresponding to a different potential combination of oligonucleotide identifiers 

fastened to colloid particles. If only two species such as proteins or small molecules 
are being assayed for binding, then only one complementary sequence would need to 
be added. 

Subsequently, all non-bound oligonucleotide is de-activated, e.g. by adding a 
25 DNAase that degrades single-stranded DNA (Fig. 12). This eliminates any 

oligonucleotide identifiers present on non-interacting colloids that did not participate 
in a binding event, and any auxiliary, non-bound interaction hybridization identifiers 
(138 and 140). Fig. 13 illustrates the result of this step. 

With reference to Fig. 14, interaction hybridization identifier 136 is removed 
30 from and isolated relative to identifiers 124 and 132 via, for example, denaturization 
(by boiling or addition of salt or Triton solution, etc.). Then, identifier 136 is 
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sequenced (Fig. 15). With the identification of species 136, the identity of identifiers 
124 and 132 is determined, and thereby the identity of one or both of species 126 and 
134 can be determined (identifier 124 had been correlated to species 126 and 
identifier 132 had been correlated to species 134 prior to running of the assay). 
5 Example: Demonstration of Control of SAM permeability to electrons 

This example demonstrates the ability to form a SAM including enhanced 
electronic communication. The SAM is formed on a surface that includes a mixture 
of a first, tight-packing species and a second species of different molecular structure 
that enhances the permeability of the SAM to electronic communication. A defect, or 

10 opening, is formed in the SAM allowing fluid to which the surface is exposed to 
communicate electrically with the surface. Specifically, certain small sulfur 
containing compounds having disruptive structures relative to the SAM as a whole 
were stably incorporated into a SAM, and greater permeability to electrons was 
demonstrated. This example demonstrates that a surface can be made electrically 

15 relatively conductive, and then support cell growth. 

A water-soluble ferrocene derivative was dissolved in the electrolyte solution: 
lOOmM solution of ferrocenedicarboxylic acid in 500uM NaC10 4 . The working 
electrode was a gold-coated electrode derivatized with a SAM comprised of varying 
amounts of 2-unit molecular wire (MFI). The height of the peak at a characteristic 

20 ferrocene potential was plotted as a function of molecular wire density. As a negative 
control, a gold-coated electrode was derivatized with an insulating SAM comprised of 
100% tri-ethylene glycol terminated thiol. This system was used to test the 
conductivity of electrodes modified with a panel of sulfur-containing compounds. 
The compounds were dissolved in DMF at 50% candidate compound and 50% tri- 

25 ethylene glycol terminated thiol. Electrodes were derivatized as described in Example 
1. SAMs were formed on gold chips from 500 micromolar triethylene glycol- 
terminated thiol and 500 micromolar of either mercaptobenzothiazole or 2- 
mercaptoethyl ether in DMF. The chips were clamped between a flat substrate and a 
1 ml capacity silicon gasket. A solution of ferrocene dicarboxyllic acid was dissolved 

30 in 500 micro molar NaC104 and placed in the silicon gasket with a Ag/AgCl 

reference electrode and a Pt auxiliary electrode. The gold chip was connected as the 
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working electrode. The system was analyzed by ACV. The magnitude of the current 
peaks, resulting from the ferrocene in solution communicating with the electrode, was 
an indicator of the ability of the trial compounds to make the SAM more permeable to 
electron flow by creating defects within the SAM. 
5 Example: Detection of Protein-protein interactions 

This example demonstrates the utility of a colloid particle having an 
immobilized signaling entity and an immobilized protein. (See Fig. 18) 

Histidine-tagged Glutathione-S-Transferase (GST-His) was attached to NTA- 
SAM-coated colloids, displaying 40uM NTA-Ni and lOOuM ferrocene-thiol. 

10 Commercially available magnetic beads presenting protein A were coated at 1/10 
binding capacity with anti-GST antibody, added at a 1 :5 ratio to the GST-colloids, 
and measured on a 50% MF-1 SAM-coated electrode, which was placed on top of a 
magnet. The magnet pulled the magnetic beads onto the electrode surface to form a 
thick, visible precipitate. The GST-colloids were brought down to the electrode 

1 5 surface by the interaction with the GST-antibody on the magnetic beads to give a 
current peak at approximately 280mV. Two negative controls were run, one where 
GST was not attached to the colloid surface, and another where the GST antibody was 
not attached to the magnetic beads. Neither negative control gave a current peak. 
Fig. 17 plots the results of this demonstration. Solid line represents interaction 

20 between GST-His-presenting colloids and anti-GST/Ab on magnetic beads. Open 

circles represent magnetic beads presenting the antibody incubated with colloids that 
did not present GST. Closed circles represent beads not presenting the antibody, 
incubated with colloids that presented GST. 

Prophetic Example: Massively Parallel Analysis of Protein-Protein Interactions: 

25 Elucidating the Interaction Map of the Human Proteome. 

The following prophetic example describes how to perform massively parallel 
analysis of protein-protein interactions, which is particularly useful when proteins are 
as yet uncharacterized. Here, this method is used to elucidate the protein interaction 
map of the human proteome. A subset of, or the entire set of, proteins of the 

30 proteome is expressed with affinity tags to facilitate attachment of the expressed 
protein to sets of particles. Particle-immobilized proteins are pooled together and 
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allowed to interact. Interacting pairs are selected from the pooled mixture by a 
reiterative magnetic selection/dilution process. Following the selection/dilution 
process, the identity of interacting partners is determined. The selection step reduces 
the complexity of the problem by eliminating the need to analyze non-interacting 
5 proteins. 

A diagram illustrating how two binding partners can be detected through 
magnetic recruitment is provided in Fig. 18. As shown, nanoparticle 9 is attached to 
electroactive signaling entity 10. An immobilized chemical or biochemical species 1 1 
is also immobilized to the nanoparticle 9, as is oligo identifier 12 which is thereby 

10 associated with species 1 1. A second chemical or biochemical species 14 is 

immobilized on magnetic bead 13. Second oligo identifier 15 is also immobilized 
with respect to magnetic bead 13 and thus is associated with second species 14. As 
binding occurs between species 1 1 and 14 (as shown in Fig. 18), a single hetero- 
particle forms that can then be magnetically drawn to sensing electrode 17 by 

15 electromagnet 16. Detection may be facilitated by signaling entity 10. 

In a more specific example, proteins and their encoding DNA molecules are 
indirectly connected to each other by co-immobilizing both on a common particle or 
bead, wherein each particle (or bead) presents a single protein species and its 
encoding DNA. Connecting the expressed protein to its encoding DNA expedites the 

20 identification of each set of interacting proteins after the selection/dilution process. 
Each protein and its encoding DNA are immobilized on two different kinds of 
particles: a recruitable particle and a signaling particle. The sizes of the particles are 
also different such that smaller signaling particles can form satellites around each 
larger recruitable particle. In this example, each protein and its encoding DNA are 

25 immobilized on a single 4-10 micron magnetic bead as well as on a multitude of fluid 
suspendable nanoparticles, that are 4-40 nm in diameter and bear electroactive 
signaling entities. When a first species on a magnetic bead biologically interacts with 
a second species on a signaling nanoparticle, the recruitable particle becomes 
"connected" to the signaling particle. These hetero-particle-complexes are then 

30 magnetically recruited to a sensing electrode (see Fig. 19), where they can deliver a 
signal, such as an electronic or electrochemical signal. To facilitate multiplexed 
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analysis, the particles are magnetically attracted to an electrode array that has a 
number of individually addressable electrode pads. Beneath each electrode pad is an 
individually controllable electromagnet, such that the magnetic field above each pad 
can be selectively turned off and on. However, magnetic particles that are not 
5 connected to signaling particles, which cannot deliver a signal, may also be recruited 
to the same electrode pad that delivers a positive signal. These non-signaling 
magnetic beads are electromagnetically released from the pad and washed out of the 
interaction reservoir. Signaling nanoparticles that do not interact with species on 
magnetic particles remain in a homogeneous suspension and do not deliver a signal 

10 but are also washed out of the interaction reservoir. Both of these purging functions 
are accomplished by maintaining the magnetic fields beneath electrode pads that 
deliver a positive signal, to retain interacting complexes, while the magnetic fields 
beneath pads that do not deliver a positive signal are driven to zero to release non- 
interacting magnetic beads. A port 150 (Fig. 19) is opened and fluid is washed out of 

15 the interaction reservoir carrying away non-interacting magnetic beads and non- 
interacting nanoparticles. The port is closed, all magnetic fields are driven to zero and 
more buffer is added through a second port 160 along with mechanical agitation to 
resuspend and redistribute the particles. All the magnetic fields beneath all the 
electrode pads are turned on again and the selection/dilution process is repeated until, 

20 statistically, each pad that delivers a positive signal contains a single magnetic bead 
bound to a multitude of signaling nanoparticles. 

To determine the identity of interacting proteins, an array of magnetic pins, 
whose dimensions correspond to that of the electrode array is juxtaposed over the 
electrode array and the magnetic fields beneath the entire electrode array are driven to 

25 zero such that each magnetic pin captures a single hetero-particle complex. The 
loaded pin array is dipped into a multi-well plate, of compatible dimensions, each 
well of which is filled with solution containing DNA amplification reagents. The 
respective encoding DNA sequences (immobilized on the nanoparticles and the bead) 
are amplified by PCR or similar technique and sequenced to reveal the identity of 

30 each set of interacting proteins. 
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Prophetic Example: Preparation of Proteins for co-immobilization on surfaces that 
also present coding identifiers. 

A DNA sequence that encodes each protein member of the proteome is 
inserted into a bacterial protein expression vector. The expression vector carries an 
5 affinity tag, (His>6 in this case, tandem repeats of Gal4 consensus sequences, and 2 
sequences that flank the protein identification sequence, to which PCR primers can 
bind. The histidine-tag facilitates the attachment of the expressed protein to the 
particle. The Gal4 consensus sequences act to tether the encoding DNA to the 
particle via the interaction between the recognition motif and a particle-immobilized 

10 yeast DNA binding domain, which in this case is a GST-Gal4 fusion protein. The 
DNA binding domain of Gal4 (aa' 1-100) binds to the consensus sequence 
CGGattAgAagcCgCCGAG and the GST binds to a glutathione moiety on the particle. 
Proteins are separately expressed in a cell-free translation system to reduce the 
abundance of irrelevant proteins and cell debris. Following protein expression, each 

15 expression mixture contains the encoding DNA and the expressed protein. Each 

protein expression mixture is divided into 2 aliquots. A single magnetic bead (4-10 
microns in diameter) is added to a first aliquot and a quantity of NTA-glutathione- 
SAM-coated nanoparticles is added to a second aliquot. Particles are pelleted and 
washed to remove protein that is not particle-bound. Particles and beads are pooled 

20 together in subsets of 1000 species per pool and subjected to magnetic 

selection/dilution and electrochemical analysis. In this manner, unidentified proteins 
can be bound to the same bead or particle that also binds the corresponding encoding 
DNA. 

In examples were colloid particles (nanoparticles) are used, they can be 
25 prepared as described in the above-referenced international patent publication nos. 
WO 00/43791 and WO 00/43783 
Prophetic example: Electrochemical Analysis 

An electrochemical analyzer from CH Instruments (Austin, TX) is used to 
detect interactions between species immobilized on magnetic beads and species 
30 immobilized on colloids that also bear redox active metals. The instrument is 

modified to facilitate multiplexed detection. In this case, the redox active metals are 
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ferrocene derivatives. Pads of the electrode array are individually addressable and act 
as the working electrode. In this case, the pads are gold-coated and derivatized with 
conductive self-assembled monolayers. A Ag vs. Ag/Cl reference electrode is used 
with a Pt auxiliary. Electrodes are scanned using Alternating Current Voltammetry 
5 (ACV) with a 25m V overpotential at a frequency of 10 Hz.. 

Prophetic Example: Design of Electrode Array Sandwiched between Individually 
addressable Electromagnets 

Electrode arrays 100 (Fig. 19) having 300-500 electrode pads 1 10 are 
constructed by plating gold over Nf"". Electrode pads are 50-500 microns on edge 

10 and are sandwiched between sets of individually addressable Helmholtz 

electromagnets 120 such that a magnetic field gradient can be generated to recruit, 
then hold magnetic beads at the pad surface (see Fig. 19). The direction of the current 
is reversed to drive the magnetic field to zero when it is desirable to release magnetic 
beads from the surface to wash away or redistribute. To ensure that the interaction 

15 reservoir is thermally isolated from the electromagnet arrays so that heat does not 

denature proteins in solution, a layer of insulative material 130 is placed between the 
electromagnets and the interaction reservoir 140. 

Prophetic Example: Calculation of Protein Sets and Electrode Pad Number 

To generate the protein interaction map of the entire proteome, one needs to divide 

20 the proteome into subsets, which are then tested for interaction with every other 
subset. Assuming that there are about 50,000 proteins of interest, the proteome is 
divided into 50 sets of 1000 proteins each. Each group of 1000 proteins is then tested 
for interaction with every other group of 1000, resulting in 50X50 matrix or 2500 
separate experiments. The number of proteins in each subset determines the number 

25 of electrode pads in each array. If we assume that each protein has a single binding 
partner, then each protein has a 1/50 chance of finding that partner when tested for 
interaction with one of the 50 subsets of proteins. However, each protein probably 
has on average 5 relevant binding partners, increasing the probability of finding a 
binding partner within a subset to 1/10. That means that for 2000 proteins in one 

30 pooled interaction mix, 200 will deliver a positive signal, which implies that the 
electrode array should have 300-500 pads. Low-level signals from non-specific 
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binding events are minimal because of competitive inhibition by relevant binders. 
However, the occurrence of false positives is minimized when a signal threshold is 
set, wherein signals below the threshold are counted as negatives. Relative affinities 
are determined by comparison of the degree of interaction of a first binding species 
5 and a second binding species with a third target protein to which the first and second 
bind. 

Prophetic Example: Determining the Binding Partners of a Single Target Protein with 
a Large Pool of Candidate Binding Partners 

This prophetic example describes how to identify proteins, from a large pool 

10 of putative binding partners, which interact with a single target protein. Proteins from 
the large pool are prepared as described above in Example 30, and immobilized only 
on signaling nanoparticles. The target protein is immobilized on a set of magnetic 
beads. Because the identity of the target protein is known, it is not necessary to co- 
immobilize its encoding DNA. Interacting partners are selected by electrochemical 

15 analysis as described above. 

Prophetic Example: Selection of Interacting Protein Partners by FACS Analysis 

This example describes how to identify the binding partners of a single target 
protein. The target protein is immobilized on a set of beads that are 4-25 microns in 
diameter. Putative binding partners, which may be prepared as described above or 

20 generated from a cDNA library, are co-immobilized along with their encoding DNA 
onto nanoparticles that bear fluorescent signaling moieties. When a bead- 
immobilized protein interacts with a species immobilized on nanoparticles, the bead 
becomes decorated with fluorescent nanoparticles and can be isolated by FACS 
(fluorescent activated cell sorting) analysis after which the attached DNA of each 

25 interacting species is sequenced to identify the binding partners. 

Those skilled in the art would readily appreciate that all parameters listed 
herein are meant to be exemplary and that actual parameters will depend upon the 
specific application for which the methods and apparatus of the present invention are 
30 used. It is, therefore, to be understood that the foregoing embodiments are presented 
by way of example only and that, within the scope of the appended claims and 
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equivalents thereto, the invention may be practiced otherwise than as specifically 
described. 

What is claimed is: 



